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Abstract 

Rice-Golomb codes are widely used in practice to encode integer-valued prediction residuals. However, in lossless 
coding of audio, image, and video, specially those involving linear predictors, the predictions are from the real domain. 
In this paper, we have modified and extended the Rice-Golomb code so that it can operate at fractional precision 
to efficiently exploit the real- valued predictions. Coding at arbitrarily small precision allows the residuals to be 
modeled with the Laplace distribution instead of its discrete counterpart, namely the two-sided geometric distribution 
(TSGD). Unlike the Rice-Golomb code, which maps equally probable opposite-signed residuals to different integers, 
the proposed coding scheme is symmetric in the sense that, at arbitrarily small precision, it assigns codewords of 
equal length to equally probable residual intervals. The symmetry of both the Laplace distribution and the code 
facilitates the analysis of the proposed coding scheme to determine the average code-length and the optimal value 
of the associated coding parameter. Experimental results demonstrate that the proposed scheme, by making efficient 
use of real-valued predictions, achieves better compression as compared to the conventional scheme. 

I. Introduction 

Prediction, which involves estimating the outcome of a data source given some past observations, is an effective 
tool for data compression. Consider the sequential encoding of an integer-valued source {Xt}, t = 0,1, . . ., over 
the alphabet {0, 1, . . . , (g — 1)}, on a symbol-by- symbol basis. In predictive coding, given the previously encoded 
data xo,xi,. . . , Xt-i, the value of Xt is predicted as xt and the residual et = Xt — Xt is encoded using an entropy 
code such as Huffman code or Rice-Golomb code. Since the previously encoded data are already available at the 
decoder, it can make the same prediction Xf and reconstruct Xt = Xt+Sf. The beneficial effect of prediction is that 
it decorrelates the data in the sense that the entropy of the residuals is significantly lower than the entropy of the 
original sequence. 

Although nonlinear predictions place no restriction on the predictor and thus can achieve better performance, linear 
predictions that restrict the predictor to be a linear function of the previously encoded data are much simpler to 
construct and analyze. Therefore, linear predictors are widely used in practice. In a order Unear predictive coding, 
after having observed the past data sequence xo,xi,. . . , xt-i, the value of xt is predicted as a linear combination 
of the previous a values as 

a-l 

xt = ^ ajxt-j (1) 

3=0 
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where Uj e M, < j < a, are real-valued predictor coefficients that need to be optimized. The most common 
measure of the performance of a predictor is the mean squared error (MSE). Therefore, the coefficients a*'s that 
minimize the MSE, i.e., — -'^IP], are considered optimal. In practice, however, a*'s are learnt by minimizing 

the MSE over a window of size oj such that 

LJ / a-1 \ ^ 

(ao,aJ, . . . ,a*_i) = argmin ^ \ xt-i - ^ a^xt-i-j \ . (2) 

(ao,ai,...,a„_i)eR° -^^ \^ .^^ J 

Note that even if the source values Xi's are integers, the predictor coefficients a*'s are drawn from the real domain 
M leading to real-valued predictions and residuals. 

Conventional predictive coding techniques round the real-valued prediction to the nearest integer [x] and then 
encode the integer residual 

e = X ~ [x]. (3) 

Given that [x] is also available at the decoder, we can restrict the number of possible values for e to g by taking 
into account the fact that x can only take values from the alphabet {0, 1, . . . , (q — 1)} (see [D). Now encoding 
of e using an entropy coder requires the knowledge of the probability distribution of the residuals. Therefore, 
in sequential symbol by symbol coding, the probabilities of the q possible residual values are also needed to be 
estimated adaptively. However, for a large alphabet there might not be sufficient number of data in practice to 
reliably estimate these probabilities. Hence, in practical applications, a parametric representation of the probability 
distribution of the residuals is often preferred H], 111. 

It has been observed that the distributions of the real-valued prediction residuals in audio 01, image jH, and 
video Is], ||6l coding highly peak at zero, that can be closely approximated by the Laplace distributions. A Laplace 
distribution, which sharply peaks at zero, is defined by the following probabiUty density function (pdf), 

= O<0<1, eeR. (4) 

Here is a scale parameter which controls the two-sided decay rate. Since conventional predictive coding schemes 
encode the integer-valued residuals, they model the distribution of e using a'discrete analog' of the Laplace 
distribution. A discrete analog of a continuous distribution with pdf ) has been proposed in |7| as a discrete 
distribution supported on the set Z having the probabihty mass function (pmf) 

= ^^oo (5) 

It follows from Q and (|5]) that the pmf of the discrete analog of the Laplace distribution takes the form, 

Pe{i) = ^e^^ zez. (6) 

i + tf 

which is known as the two-sided geometric distribution (TSGD) in the hteratme 

'The subscript t has been dropped for notational convenience. 
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Popular prefix coding schemes use Golomb codes |[8l to exploit the exponential decaying in the pmf of integer 
residuals. However, Golomb codes are optimal ||9l for the one-sided geometric distribution (OSGD) of the form 

pe{i) = {I - B)e\ i>{). (7) 

Given a positive integer parameter nig, the Golomb code of i has two parts: the prefix [?/"^gJ in unary representation 
and the reminder of that division, i mod nig, in minimal binary representation. The unary representation of a non- 
negative integer j consists of j number of ' 1 's followed by a '0' . The minimal binary representation of a non-negative 
integer k from the alphabet {0, 1, ... , mg — 1} uses [Ig m^J bits when k < 2 rig "si - mg or [Ig nig~\ bits otherwise. 
For a given 9, the optimal value of the parameter nig is given by 19| 

"■i^l . (8) 

-\ge 

Since Golomb codes are defined for non-negative integers only, popular Golomb-based codecs map the integer 
residual e into a unique non-negative integer prior to encoding by the following overlap and interleave scheme 
originally proposed by Rice in IJOJ . 

, . f 2e, £ > 0; 

AfR.ce(e) - S (9) 
I — 2£ — 1, otherwise. 

Although Golomb codes are optimal for geometrically distributed non-negative integers, the above mentioned scheme 
with Rice-mapping is not optimal for TSGD (see Q). Moreover, when the predictions are constrained to integers, 
a real-valued bias is typically present in the prediction residuals, which introduces a shift parameter /i in the TSGD 
model in addition to the parameter 6 (see Q and ITTl ). This off-centred TSGD is modeled by the following pmf ||2| 

pg,MW = gi-V+V ^''^'''' 0<Ai<i, (10) 

A complete characterization of the optimal prefix code for the off-centred TSGD has been presented in [2]. It divides 
the two-dimensional parameter space {6, /i) into four types of region and associates a different code construct with 
each type. However, the article admits that two dimensionality of the parameter space adds significant complexity 
to the characterization and analysis of the code. Moreover, being optimal for the off-centred TSGD, these codes do 
not preclude improving predictive compression gain further if the residuals could be handled in the real domain, 
minimizing the loss due to rounding. 

In this paper, we extend and modify the Rice-Golomb code so that it can handle the real-valued predictions at 
an arbitrary precision. More specifically, the contribution of the paper can be summarized as follows. Firstly, we 
generalize the Rice mapping (|9]l so that it can operate at an arbitrary precision. We then present the complete 
encoding and decoding algorithms based on the generalized Rice mapping. With the generalized Rice mapping, 
although the encoding is similar to Rice-Golomb encoding, the decoding is slightly convoluted. One of the salient 
features of the proposed coding scheme is that it is symmetric, i.e, when operating at finest precision, it assigns 
codewords of equal length to equally probable residual intervals. Secondly, assuming that the real-valued residuals 
are Laplace distributed, we analyze the proposed coding scheme and determine the close form expression for the 
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average code length. Thirdly, we determine the relationship between the scale parameter 6 and the optimal value 
of the coding parameter m when the code operates at the finest precision. The analysis of the proposed scheme to 
determine the average code-length and the optimal value of the associated coding parameter is greatly facilitated by 
the symmetry in both the Laplace distribution and the code. Although, for the codes operating at other precision, a 
relationship between and m is not readily available, we have demonstrated analytically and experimentally that 
a sub-optimal strategy incurs negligible redundancy at sufficiently small precision. 

The organization of the rest of the paper is as follows. The modified Rice-Golomb code is presented along with 
a novel generalized Rice mapping and the implementation details of the proposed coding scheme in Section [ll] In 
Section [nij the proposed scheme is then analyzed to determine its average code-length and the relationship between 
the scale parameter and the coding parameter m at different fractional precisions. Finally, experimental results 



ai^e presented to demonstrate the efficacy of the proposed coding scheme in Section IV 



II. Modified Rice-Golomb code at fractional precision 

In conventional predictive coding of integers, the real-valued predictions are first mapped to their nearest integers 
and then the integer-valued residuals are encoded with an entropy code. In order to extend the Rice-Golomb code 
to exploit the real-valued prediction at any arbitrary precision, let the scheme operate at precision p/r, where 
p and T are positive integers and p < t. Therefore, prior to residual encoding, the prediction x is rounded to 
[x]pfT — p\tx/ p + l/2j/r, which is the integer multiple of p/r nearest to x. Standard Rice-Golomb code is then 
an instance of this extended code with p = t = 1. 

Although [x]p/T can take any integer multiple value of p/r, using the fact that unknown x is from Z, the decoder 
can deduce that the residual Sp/^ = x — [xj^/^ will take integer-apart values in the form 

Ej = J + A/t where 7 e Z and A is a fixed integer from [0,t). (11) 

Example 1. When p — I and t — A, the possible residual values are integer multiples of p/r — 1/4, i.e., 
{0, ±1/4, ±2/4, ±3/4, . . .}. Let the prediction be x — 0.70, which must be rounded to the nearest multiple of 1/4, 
i.e., to [a;]i/4 = [4 x 0.70 + 1/2J/4 = 0.75. Once the rounded prediction [a;]i/4 is fixed to 0.75, the possible 
residual values are {. . . , —1.75, —0.75, 0.25, 1.25, . . .}, which are of the form = 7 + 1/4, 7 e Z. 

Now these integer-apart discrete residual values need to be mapped to unique non-negative integers so that they 
can be encoded using Golomb codes. For an efficient implementation of the code, the mapping also needs to be 
easy to compute. 

A. Residual mapping 

According to the Laplace distribution, small-valued residuals have higher probabilities than those of large-valued 
residuals. Since Golomb codes assign shorter-length codewords to small-valued non-negative integers, small-valued 
discrete residuals should be mapped to small-valued non-negative integers. In the above example, 0.25 should be 
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mapped to 0, and —0.75 should me mapped to 1 and so on. More generally, if A < t/2 then |eo| < < kil < 
|e_2| < • • • ■ Thus, should be mapped according to the function 

[27, 7 > 0; 

M^<r/2{e^) = { (12) 

I —27 — 1, otherwise. 

Similarly, if A > r/2 then |e-i| < |£o| < k-21 < l^il < • • • and consequently, e-y should be mapped according 
to the function, 

[27 + 1, 7 > 0; 

M^>r/2{e^) = \ ' - ' (13) 

I —27 — 2, otherwise. 



The mapping (12 1 and (13 1 require an explicit formula for the computation of 7 associated with the residual 
Ep/r — X — [x]p/T- It can be shown that 

£p/r = [ep/rj + [riSp/r " L^p/r J )J /t- (14) 

and 



Obviously, [Cp/^J G Z and [r(£p/^ — [ep/^-J )J < r. Therefore, it follows from ( 1 1 



14 1 that 



7 = [Ep/rJ (15) 

and 

A= Lr(e-p/r-L£p/rJ)J. (16) 
When these values of 7 and A are substituted in ( [T2| and ( [T3| ), both the mappings converge to the following, 

^ / L2£p/rJ, ifep/r>0; 

^(£p/r) = < ^ (17) 

I — [2ep/^J — 1, otherwise. 
Indeed this mapping is the generalization of the Rice mapping (|9jl. When p = t = 1, the residual Ep/r = £ is 
integer valued and thus [2eJ — 2e. In this case, the mapping ([iTji transforms into the Rice mapping (j9|. In the 
other extreme case of p/r 0, the prediction x is not rounded at all. For this asymptotic case of p/r — ?• 0, we 
will denote the mapping with M{e). 



B. Encoding and Decoding 

Having defined the mapping at precision p/r, the encoding operation is similar to the Rice-Golomb coding, however, 
the decoding operation is slightly convoluted due to the use of the floor function in the residual mapping ( [T7| . 

Encoding: Given the parameter value m, compute j and k as follows 

3^[M{ep,r)/ni\, (18) 

A: = M(ep/^) mod m. (19) 

Then encode j in unary and k in minimal binary. The pseudocode of the encoding algorithm is given in Algorithm [T| 
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Algorithm 1 Encoding 

Input: X e Z, i e M, r, p, TO e Z+ 
Output: j,k. 

[S:]p/T p[tx/p+1/2\/t; 

if Ep/r > then 

M(£p/r) L2£p/rJ; 
else 

M(£-p/,) := -L2£p/,J - 1; 
end if 

j := [A/(ep/^)/TOj; 

fc :— AI{ep/^) mod m; 

Encode j in unary and fc in minimal binary; 



Decoding: Given to, the decoder can compute M{ep/.,-) — jm + k. Given x, recovering x from Af(ep/^), however, 
is not straightforward. In Rice-Golomb coding, by checking whether M^iceiS) is even or odd the decoder can decide 
which of the constituent functions in ^ was used: if M^[cg{e) is even, it is the first constituent function that was 
used and e — AfRice(e)/2; otherwise the second constituent function was used and e = — (MRice(e) + l)/2. UnHke 
the Rice mapping (j9]), where the value of the first constituent function is always even and the second always odd, 
the value of the both constituent functions in the residual mapping ( [T7| can be even or odd. However, in conjunction 
with [a;]p/r, as explained below, it is possible to deduce from AI{£p^^) which of the constituent functions was used. 

Consider the first constituent function of the residual mapping ( [T7] i. 

M{ep/r) = [2ep/r\ 

= [2x-2[x]p/r\ 

= 2x~ \2[x]p/^] ■: X eZ. (20) 



It follows from (20 1 that the value of the first constituent function is even (odd) if \2[x]p/T-'\ is even (odd). Now 
consider the second constituent function of ( fTTj l 

M[ep/r) = -[2ep,r\-l 

= -Y2x-2[x\p,r\-l 

= -(2x + l)+ [2[i]p/^] -.-xeZ (21) 

It follows from (21 1 that the value of the second constituent function is odd (even) if \2[x]p/t'\ is even (odd). 
These relationships between [2[2;]p/7-] and M{ep/T-) are summarized in Table |l] These relationships now can be 
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TABLE I: The relationship between [2[i;]p/,-] and M{£p/t) 













Even Odd 




Even 


M(ep/r) = \ 2ep/A M{ep,,) = - \2ep,A - 1 


\mp/A 








Odd 





used to decide which of the constituent functions in the mapping ( 17 i was used. If both [2[i]p/^] and Miepj^) 
are even or both are odd then the first constituent function Mispi^) = [2ep/T-J was used; otherwise the second 
constituent function MiSpir) = — [2£p/T-J — 1 was used. Having decided on the constituent functions, the decoding 
now follows from ( |20] i and ( |2T] i as 



A./(.-,,.)+r2[x],;.1 ^ ^ ^2[£]p/,l is even; 

otherwise. 



r2[a]p/.i-Af(£p/j~i 



2 

The pseudocode of the decoding algorithm is given in Algorithm |2] 



Algorithm 2 Decoding 

Input: X e M, r, p, to € Z+, k 
Output: X 

M{ep/r) ■■= jm + k; 

[x]p/T p[tx/p+1/2\/t; 

if M(ep/^) + [2[a;]p/7-] is even then 
r2[a]p/^1+M(gp/0 . 

else 

r2[£]p/^1-M(ep/,)^l . 
X .— 2 ' 

end if 



III. Analysis 



In Section 



III-A 



the association of non-negative integers with different residual intervals by the mapping M{ep/r) 



is determined, which aids in computing the average code-length of the proposed coding scheme in Section [III-B| 
A. Association of non-negative integers to residual intervals 

When operating at precision p/r, a real-valued prediction x E [i + np/r — p/2t, i + np/r + p/2t), where i,n EZ 
and < n < T / p, is rounded to 

[x]p/r = i + np/r. (23) 
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Then for a given x, the residual takes the form 

= {x — i) — np/T. (24) 

Another relevant quantity is [2[x]p/7-] — \2i + 2np/T\, which is even if either n = or T/2p < n < t/ p and odd 
if < rt < T/2p. Now depending on M{ep/r) and [2[i]p/^], there can be four cases. 

Case 1: Both Af(ep/T-) and [2[x]p/^] are even 

In this case, M{ep/T) = 21 for some I G Z+ and either n = or r/2p < n < r/p. If n = then from Table |l] it 
follows that 

"'21 



2{x-i) - 2n- 

T. 



=^ 2{x~i)^2l •.■n = 

X = i + l. (25) 

Therefore, the non-negative integer associated with the residual x ~ x E {I — p/2t, I + p/2r] is 21. For example, 
when p ~ I and t = 4, the non-negative integer associated with the interval (—0.125, 0.125] is 21 = 0. 
On the other hand, if t/2/3 < n < r/p then from Table [l] it follows that 



2{x - i) - 2n- 

T. 



21 

^ 2{x-i)-2^2l ■: T/2p <n<T/p 

=^ x = i + l + l. (26) 

Therefore, the non-negative integer associated with the residual x — xG {l + l — np/r — p/2t, l + l — np/T + p/2t] 
is 21 where T/2p < n < r/p. For example, when p = 1 and r = 4, we only have n = 3 satisfying the condition 
2 < 71 < 4. Therefore, 2/ = must be associated with the interval (0.125, 0.375]. 

Case 2: M{ep/r) is even and [2[x]p/^] is odd 

In this case, M{ep/^) — 21 for some / e Z+ and < n < r/2p. Now from Table |l] it follows that 

- 2{x -i)- 2n-J -1 = 21 
=> -2{x-i) + l-l = 2l ■:0<n<T/2p 

=> x = i-l. (27) 

Therefore, the non-negative integer associated with the residual x — i G {—I — np/r — p/2r, ~l — np/r + p/2r] 
is 21 where < n < T/2p. For example, when p = 1 and r = 4, we have n = 1 and n = 2 such that < n < 2. 
Thus 21 = must be associated with the interval (—0.375, —0.125]. 

Case 3: M(ep/^) is odd and [2[a;]p/T-] is even 
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In this case, M(£p/^) = 21 + 1 for some / G Z+ and either n = or t/2/3 < n < t / p. If n = then from Table |l] 
it follows that 



2{x -i)- 2n- 

T . 



- 1 = 2/ + 1 
=^> -2(a; - - 1 = 2Z + 1 •.■ n = 

+> x = i-l-l. (28) 

Therefore, the non-negative integer associated with the residual x — x ^ {~l — 1 — p/2t, —I — 1 + p/2t] is 21 + 1. 
For example, when p = 1 and r = 4, the non-negative integer associated with the interval (—1.125, —0.875] must 
be 2/ + 1 = 1. 

On the other hand, if T/2p <n < r/p then from Table |l] it foUows that 

- 2{x - i) - 2n- -1 = 21 + 1 

T - 

=> -2{x -i) + 2-1 = 21 + 1 ■: T/2p < n < t/p 

^ x = i~l. (29) 

Therefore, the non-negative integer associated with the residual x — x ^ {—I — np/r — p/2t, —I — np/r + p/2r] is 
21 + 1 where t/2/9 < n < r/p. For example, when p = 1 and t = 4, we only have n = such that 2 < n = 3 < 4. 
Therefore, 2/ + 1 = 1 must be associated with the interval (—0.875, —0.625]. 

Case 4: Both M{ep/r) and [2[x]p/^] are odd 

In this case, M{ep/T-) = 2/ + 1 for some / e Z+ and < n < T/2p. Now from Table |l] it follows that 

2(x-i)-2n- =21 + 1 

T- 

^ 2{x-i) -1 = 21 + 1 •.•0<n<r/2p 

x = i + l + l. (30) 

Therefore, the non-negative integer associated with the residual x — x£ {l + l — np/r — p/2t, l + l — np/r + p/2t] 
is 2/ + 1 where < n < T/2p. For example, when p = 1 and t = 4, we have n = 1 and n = 2 satisfying 
< n < 2, and thus 21 + 1 = 1 must be associated with the interval (0.375, 0.875]. 

The assignment of non-negative integers by the mapping M (Sp/,-) to different intervals of the residual e for different 
values of p/r is depicted in Fig. [T] 

B. Average code-length 

It follows from Fig. [T] that when p = t = 1, which corresponds to the Rice mapping (|9]l, the assignment of 
non-negative integers to different residual intervals is asymmetric as equally probable, opposite-signed intervals 
are mapped to different integers. This asymmetry results in the assignments of codes of different lengths to 
equally probable residual intervals. However, this asymmetry reduces at finer precision and the assignment becomes 
symmetric in the asymptotic case of p/r — > 0. 
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I 3 I 1 I I 2 I 4 I ^Rice(^) 

1 « 1 1 1 1 1 

-2,5 -1.5 -0.5 0.5 1.5 2.5 S 

I M ^ I M ; " I ' I ^ M I ; I ^^-) 

-2.75 -2.25 -1.75 -1.25 -0.75 0.25 0.75 1.25 1.75 2.25 ^ 

I M M M ' I " I ' l ^ I ' I " I ^(^ '"^ 

-2.625 -2.125 -1.625 -1.125 -0.625 0.375 0.875 1.375 1.875 2.375 E 

|4 ^ 3|2 ^ 1| ^ I' ^ ^l-^ ^ '*! ^ "^^^ 

-2.5 -2.0 -1.5 -1.0 -0.5 0.5 1.0 1.5 2.0 2.5 £ 

Fig. 1: The assignment of non-negative integers to different residual intervals for different values of p/r. While at 
a coarse precision the assignment is asymmetric, at finer precision the assignment becomes symmetric. 



2 ~T "TOY T 2 

— p p— • • • p • p • • • — p p— £ 

III III 
III III 
III III 

I l + i + \gm I I l-^Igm / / l + i + \gm j IJs) 

III III 
III III 

ij^- • • • C^^ • C^^ • • • — C^^ £ 

(/ + l)m p _''"_P -H-JL —-J1. im _ p (i + l)m p 

2 2r 2 2r 2 2r 2 2t 2 2r 2 It 

Fig. 2: Association of code-lengths to different residual intervals when to = 2^, /3 G N. The code-length assignment 
shifts to the left by p/2t as compared to the symmetric assignment achieved at precision p/r — > 0. 



The analysis of the modified Rice-Golomb code becomes simpler for the asymptotic case of p/r ^ due to the 
symmetric assignment of non-negative integers to the residual intervals. It can be observed from Fig. [T] that the 
assignment corresponding to M{ep/^) is same as the assignment corresponding to M{e) but with a left shift of 
p/2t. When M{ep/T) is encoded using a Golomb code with a parameter to, this left shift is also reflected in the 
association of code-lengths with different residual intervals. Let for a given to, the length of the code associated 
with £ in the modified Rice-Golomb code at precision p/r be £m^(e)- In the asymptotic case of p/r — > 0, the 
code-length im^{s) will be denoted by imi^)- Now let us consider two cases depending on whether to is a power 
of 2 or not. 

1) The case to ~ 2^: When to = 2'^, the minimal binary representation of k always takes IgTO bits. In the 
asymptotic case of p/r -> 0, for any residual e such that iTO/2 < |e| < (« + 1)to/2, we get j — i. As the unary 
representation of i requires i + 1 bits, the length of the code associated with e is (e) = 1 + z + Ig to . Now it 
follows from Fig.|2]that the code-length associated with the left-shifted intervals {im/2 — p/2t, {i + l)m/2 — p/2t) 
and (— (i + 1)to/2 — p/2r. —im/2 — p/2r) is £„i(£) = 1 + i + IgTO. This observation can be used to determine 
the average code-length of the modified Rice-Golomb code. Given to, let the average code-length achieved with 
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the modified Rice-Golomb code, in encoding residuals that are Laplace distributed with parameter 0, be Vjj [ff). 
Then we have the following theorem. 

Theorem 2. //to = 2'^,/? e N, then L^Jj {0) = 1 + lgm+ 53^^(61^ +9-^). 
Proof: We have 

/+00 
tPjj{e)fg{s)de 
-00 

/O /'+00 
lt{e)fe{e)de+ / l^J,^ {e)fe{e)de 
-00 JQ 

= m^-{e) + LPi^+{e) (31) 

where L'^rl''^ {6) = i^d^ {e)fg{e)de and L^^+(6i) = /+°° e?^^ {e)fg{e)de. Now it follows from Fig. ji] that 
Substituting e = —e, 

L?l^^i0)^-^ (l + lgm)rd£-— ^ / (l + i + lgTO)rde. (33) 



2 2-r i — 1 



After algebraic simplification, 



i—1 2 ^ 2t 



1 nm/2 

1 ^ /ip/2r 



(34) 



From Fig. |2] it also follows that, 

= - ^ / ' " (1 + lg"i)^''^£ -\Y. iS " (1 + * + lg™)^'rfe- (35) 
After algebraic simplification. 



i— 1 2 2t 



1 1 f^Tfljl 

LpJj+(9) = -(i + igTO) + j^e~p^^\ 

m \ J 2 ^ ^ '21 — 



(36) 



Now from (31 1, (34i, and (RSI we get 



1 fj^it-l'^ 

(0) = 1 + ig TO + - (e''/^- + e-pi^^) 



□ 



2) TTze case to 7^ 2^.- First we determine the code-lengths associated with different intervals for the asymptotic 
case of p/t 0. Let T = 2r's™l — 771 — l. When to ^ 2'^, the minimal binary representation of k takes [IgTOj 
bits if < fc < T or [Ig to] bits if T + 1 < k < m. Thus the minimum possible code-length in this scheme 
is 1 + [Ig toJ which results when j = and < fc < T. Since T + 1 < to, this minimum code-length is 
associated with the interval (-(T + l)/2, (T + l)/2). Beyond the point |e| = (T + l)/2, we have fc > T and 
the minimal binary representation of fc requires [IgTO,] bits. However, the value of j remains for jej < to/ 2. 
Therefore, ^m(e) = 1 + [IgTO] for {T + l)/2 < |e| < to/2. Now, when |e| > to/2 although j becomes 1 whose 
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r + 1 (i + l)m T+l im T + 1 T + 1 j-^j j+i (i + i),„ 

2 2 2 2 2 2 ^^"^T 2 ^ 2 

— O (D— • • • O • O • • • — O s 

III III 
III III 
III III 

/ 2 + (- + [lgmJ / / 1 + LlgmJ / /2 + i- + [lgmJ/ l is) 

II I I I I "' 

^— • • • • ^ • • • —<i- e 

_r + l_(i + l)m_ p r+1 im p T + 1 p Q T +1 p T + l itn_P_ T + l (i + l)m p 

~ ~ Tt 2 2 It 2 2t 2 2t 1 1 It ~Y' 2 2r 

Fig. 3: Association of code-lengths to different residual intervals when m ^ 2^ for any integer /3 > 0. The 
code-length assignment shifts to the left by p/2r as compared to the symmetric assignment achieved at precision 
p/t 0. 



unary representation requires two bits, the value of k becomes < T for m/2 < \e\ < m/2 + {T + l)/2. Hence, 
£m{£) = 2 + [IgmJ for m/2 < |£| < m/2 + {T + l)/2. Since, when m ^ 2^, we have [Igm] 1 + [IgmJ, for 
(T + l)/2 < |e| < m/2 + (T + l)/2, we get im{e) = 2 + [IgmJ. This calculation is generahzed in the following 
lemma. 

Lemma 3. Given m ^ 2^^ for any integer /3 > 0, if '^-^ + ^ < e < "^-^ + ^^^2"*™ then the code-length associated 
with e is £,„ {e) — 2 + i+ [Ig mj . 

Proof: See Appendix [A] □ 

Now for an arbitrary p/r, the minimum code -length 1 + [IgmJ is associated with the left-shifted interval (— (T + 
l)/2 — p/2t, [T + 1)/2 — p/2t) and the code-length 2 + i + [IgmJ is associated with the left-shifted intervals 
((T + l)/2 + Mn/2 - p/2t, {T + l)/2{i + l)m/2 - p/2r) and {-{T + l)/2 - {i + l)m/2 - p/2t,-{T + 1)/2 - 
im/2 — p/2t). Using these association of code-lengths to different intervals as depicted in Fig. [3] we can determine 
the average code-length L'iJj {0). 

Theorem 4. Ifm^ 2^ for any integer /3 > then L^'Jj {Q) = 1 + [IgmJ + 5 y=¥^(^* +6*-^). 
Proof: We have 

/+00 
lPl:{e)fe{e)de 
-00 

^0 /'+00 



lfJ:{e)fe{e)de+ / 1^!: {e)!e{e)de 
1 Jo 

= (37) 
where L''Jj^{e) = f_^ejj (e)je{e)de and L^^+(6l) = tjj {e)fe{e)de. Now it follows from Fig.|3|that 

LP/^-W^-if /^_^(l+[lgmJ)^-^de-i|^E /_^/J^^:^(2 + ^+LlgH)^-^rf- (38) 



'27 i=0 ' 
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Substituting e — —e. 



After algebraic simplification, 

1 1 6)(T+l)/2 

= 2(1 + LigH) + 213^^ ^^^^ 

From Fig. |3] it also follows that, 

. „ (r+i) __e_ 00 „ (^+i) + ('+!)"' _JL 

L':/J^ (0)^-^1 "(l+LlgH)^^rfe-^E L ' ' "(2 + z+LlgmJ)rde. (41) 
After algebraic simplification. 



1 1 6l(^+i'/2 
-(l+|lgm|) + ^6'-''/2^ 



(42) 



Now from ([STJ, (|40|l, and (g2]i we get. 



□ 



From Theorem [2] and Theorem |4] the average code-length of the modified Rice-Golomb code can be summarized as, 

1 + Ig m + i (r/^r + 5/-p/2r ) ^ if m = 2^^ , /3 e N; 
- { '" Vri«^i-„ (43) 

1 + [IgmJ + (^^^^^ + 6'"''/^^), otherwise. 



Let denote the asymptotic average code- length with L„i{9). Then it is immediate from (43 1 that 



l + lgTO+-^, if TO = 2'^,/3 G N; 
Lrnid) = { '' Vr.«^i_„ (44) 

1 + [IgmJ + — , otherwise. 

1 — 2 

C. Optimal value of m 

We first determine the association between 6 and the optimal value of to for the asymptotic case of p/r — > 0. The 
relation between the optimal value of to and 6 is summarized in the following theorem. 

Theorem 5. When p/r — > 0, the value of m is optimal for S [4>m-i^ 4>m] where (j)m is the unique root of 
f^{(j)) = + 0™ - 1 in the interval (0, 1). 

Before proving the theorem, we present few lemmas that will aid in the proof of the theorem. 
Lemma 6. /„(</') — (/)"+^ +</>" — 1 = 0, n > 0, has a unique root in (0, 1). 
Proof: See Appendix [B] 

Lemma 7. cfy^ is a unique root of Lj^lO) = in (0, 1), where is the unique root of (f)"^~^^ +0™ — 1 = 

in (0, 1). 



October 26, 2012 



DRAFT 



13 




0.1 0.2 0.3 0.4 0.5 0.6 
Scale parameter, 9 



0.7 



Fig. 4: £1(6*) intersects L2i9) at point 0f = 0.3820 from above and L2{6) intersects £3(6') at point 02 = 0.5698 
from above. Therefore, Li{9) < L„i{9) for all m 7^ 1 and m. = 1 is optimal for 6 E (0, 0.3820]. Similarly, m = 2 
is optimal for 6 e [0.3820,0.5698]. 



Proof: See Appendix [C] 



Lemma 8. 0„i+i > 



Proof: See Appendix [P] 



Lemma 9. lime^o Ljn+i{0) > lime^oLmiO). 
Proof: See Appendix |Ej 
Now we prove Theorem |5] 

Proof: By Lemma [t] Lm-i{0) and Lm{0) intersect only at one point 0^„„i in (0,1). According to Lemma |9] 
Lm{0) intersects L„i^i{d) from above at point (pf^-i (see Fig. |4|. Therefore, 



L„(0) <L„_i(0), foT0>^l_,. 
Similarly, L„i_|_i(6') intersects Lm{0) from above at point (/)^„ (see Fig. [4|l and 

Lra+li0) < Lraie), for0>0^. 



(45) 



(46) 



Now ipm > 4>m-i according to Lemma [s] and thus L„i{0) < Lm'{0) in the interval (fin^i 1^ 1^ 4'm foi" 
m' 7^ m. □ 



Although the above theorem states the relationship between the optimal value of m and the scale parameter 9, the 
determination of the optimal value of m, for a given value of 6, is not straightforward. Therefore, we propose the 
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TABLE II: The maximum redundancy incurred by the sub-optimal use of mg for 6 — 0.01, 0.02, 0.97 at 
different precisions. 





Precision, p/r 






4/5 1/2 1/4 


1/8 1/16 


1/32 


Maximum redundancy(%) 


22.34 7.39 1.72 


0.42 0.11 


0.03 



following indirect method. Given m, the interval of 9 in which m is optimal is determined and stored in a table. 
Then for a given value of 9, finding the optimal value of m will involve a table look up. Since optimal value of 
TO increases with 9, the logarithmic time binary search algorithm iTHll can be used for searching the table. For 
example, if 32 intervals corresponding to 32 values of to are stored in the table, only five comparisons are required 
to determine the optimal value of to for a given 9. In practice, however, only a few intervals need to be stored. 
For instance, to = 32 is optimal for 9 e (0.9782, 0.9789), which corresponds to an almost uniform distribution. 
However, if the predictions are good enough, the distribution will be highly peaked at zero and the value of 6 will 
be small. Therefore, if only the first few intervals are stored in the table and the maximum value of to in the table 
is used when 9 falls outside the range of the table, there will be negligible loss of compression efficiency. 

Although a relationship between 9 and the optimal value of m exists for the asymptotic case of p/r —> (Theorem[5|l, 
no such closed form expression is readily available for an arbitrary value of p/r. For a given 9, let me be the 
optimal value of to at the asymptotic precision of p/r — > 0. Now we can demonstrate that, if the coder operates at 
precision p/r and uses me as the coding parameter, then it incurs a negligible redundancy when p/r is sufficiently 
small. When operating at precision p/r, let the sub-optimal use of me results in a redundancy of Ap^'^{9) bits as 



compared to Lmg{9). Then it follows from ( |43| and (44i that 

f -^^(6"'/^^ + 9-P/^-' - 2), if TOe = 2^5, /3 G N; 
A''/^9)^\ ^ ;rl^.i_„, (47) 
e %_ — (gip/2r ^ 5,-p/2r _ 2), Otherwise. 

V 1-9 2 

The maximum redundancy incurred by the sub-optimal use of me as compared to L,,,^ {9) at different precisions, 
is shown in Table |ll] It follows from the table that, although at coarse precision this sub-optimal strategy results in 
significant redundancy, at finer precision the maximum redundancy becomes negligible. For example, at precision 
1/16, the maximum redundancy is bounded below 0.11%. 



D. Empirical estimation of 9 and optimal value of m 

Besides having superior performance, the proposed scheme has several advantages over the Rice-Golomb code 
when simplicity of implementation is of concern. The Rice-Golomb code being asymmetric is difficult to analyze. 
There are no known method for the determination of the optimal choice of mg, for given a 9. Even the evaluation 
of the average code-length is difficult for the Rice-Golomb code. The Rice-Golomb code, first proposed in |10|, 
divides the data sequence into blocks of length J and computes the code-lengths using a set of values for the 
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parameter mg. The value of that yields the shortest code-length is then selected. This approach involves a delay 
at the beginning of the transmission of each block and thus is not suitable for on line symbol-by-symbol encoding. 
Moreover, Rice-Golomb code also requires the explicit transmission of the chosen parameter value nig. In contrast, 
the of a Laplace distribution can be estimated on line from previously encoded sequences Xq,Xi, . . . ,Xt-i and 
the corresponding predictions xo,xi, . . . ,Xt-i, as 9 — e^*/'^' where St = J2o~^ ~ ~ X^o"^ k«l (^^^ [13j)- 
From the estimated 6, the optimal value of m can be determined using the strategy described in the previous 
sub-section. Since, the decoder also has the same previously encoded data and the corresponding predictions, it can 
also estimate the same 9 and determine the same value of m. 

IV. Comparative Analysis and Experimental Results 

In this section we first demonstrate that the proposed coding scheme at precision p/r — > always outperforms 
the Rice-Golomb code. Then we presents simulation results to analyze the performance of the proposed scheme at 
different values of the scale parameter 9 and precision p/r. 

The code-lengths in different residual intervals for 9 — 0.1,0.4, and 0.6 are shown in Fig. |5]for both the Rice- 
Golomb scheme and the proposed scheme. Optimal values of the coding parameter for 9 — 0.1, 0.4, and 0.6 are 1, 
2, and 3 respectively for both the schemes. While, in the proposed scheme, the code-length of a positive residual is 
equal to that of its negative counterpart, Rice-Golomb coding uses shorter codes for negative intervals than equally 
probable positive intervals. Consequently, the lengths of Rice-Golomb codes for some negative intervals are one 
bit shorter than the proposed codes; while the opposite holds for some of the positive intervals. However, for each 
negative interval for which the Rice-Golomb code is shorter, there exists a positive interval of a higher probability 
for which the proposed code is shorter For example, in Fig. |5|a), the first interval on the negative side in which the 
Rice-Golomb code is shorter than the proposed code is ( — 1.5, —1) having probability 0.0342. On the other hand, 
the interval (0.5, 1) of probability 0.1081 is the first positive interval in which the proposed code is shorter than 
the Rice-Golomb code. With increasing 9, however, the differences between the probabilities of positive intervals 
in which the proposed code is shorter and of the corresponding negative intervals in which Rice-Golomb code is 
shorter is reduced. Therefore, the coding gain of the proposed scheme over Rice-Golomb code also diminishes with 
increasing 9. For example, in Fig. |5jb) and Fig. |5]^c) the first positive intervals in which the proposed scheme has 
shorter code-length are (0.5, 1) and (1.5, 2), having probabilities 0.1162 and 0.0524 respectively. The corresponding 
negative intervals in which the Rice-Golomb scheme has shorter codes are (—1.5,-1) and (-2.5,-2), having 
probabilities 0.0735 and 0.0406 respectively. 

In order to empirically evaluate the coding performance of the proposed modified Rice-Golomb scheme, 9 in 
the range [0.01,0.97] with increments of 0.01 and p/r = 1,4/5,1/2, 1/4,1/5,1/8,1/16, and p/r were 
considerecj^ First we randomly generated 100,000 uniformly distributed integers in the range [0, 127]. For each of 

^ We implemented the codecs in MATLAB, whose default spacing of floating point numbers is 2.2204e — 016. Thus in our implementation 
p/t = 2.2204e - 016 corresponds to p/r — !> 0. 
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9 = 0.1, 



Rice— Golomb code 
Probability 
I. _ .1 Proposed scheme( p/x — > 0) 



0.5 
0.45 
0.4 
0.35 



o 

0.2 £ 
0.15 
0.1 
0.05 



-2.5 -2 -1.5 -1 -0.5 0.5 1 1.5 2 

e 

(a) 

9 = 0.4, m = 2 



2.5 



Hp 



■§2 



■ 



Rice-Golomb code 
[Probability 

Proposed scheme( p/x — > J 



-0.2 
-0.18 

0.16 

0.14 

0.12 I" 

0.1 _§ 
o 

0.08 (£ 
0.06 
0.04 
0.02 



-1 -0.5 0.5 1 1.5 

8 

(b) 

8 = 0.6, m = 3 



2.5 



"0 



Rice— Golotnb code 
Probability 

Proposed scheme( p/x — > 0) 



-1 -0.5 

8 

(c) 



0.5 



1 1.5 



2.5 



Fig. 5: Probability of intervals and the codes associated with them in both the Rice-Golomb coding and the 
proposed coding scheme (operating at precision p/r 0) for Laplace distributed residuals for different values 
of 9; (a) e = 0.1; (b) = 0.4; and (c) = 0.6. 
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TABLE III: The average code-lengths achieved by the proposed modified Rice-Golomb scheme at different 
precisions. 









Laplace scaling 


parameter, 


e 




Precision 


0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


p/t 


= 1 


1.54311 


1.90271 


2.2863 


2.73099 


3.06241 


3.46446 


p/t = 


4/5 


1.52707 


1.87203 


2.25887 


2.6775 


3.01546 


3.4579 


p/t = 


1/2 


1.53743 


1.88422 


2.26756 


2.67853 


3.01862 


3.46191 


p/t = 


1/4 


1.47973 


1.83218 


2.2252 


2.66607 


3.00676 


3.45299 


p/t = 


1/5 


1.4634 


1.81974 


2.2114 


2.66405 


3.0049 


3.45013 


p/t = 


1/8 


1.4643 


1.82011 


2.21453 


2.66147 


3.0028 


3.44999 


p/t = 


1/16 


1.46171 


1.81674 


2.21041 


2.66053 


3.00278 


3.44938 


p/t - 


-4 


1.46074 


1.81614 


2.20911 


2.66071 


3.00176 


3.4499 


Lmg 




1.46248 


1.80902 


2.21103 


2.66667 


3 


3.44719 



the integers, the associated real-valued prediction was generated so that the residuals are Laplace distributed with 
parameter 0. Then for each 6, the optimal value of the parameter m for Rice-Golomb coding, which corresponds to 
coding at precision p/r = 1, was determined exhaustively. However, in choosing the value of the coding parameter 
m at other precisions, we adopted the sub-optimal strategy as discussed in the previous section. More specifically, 
for a given 6, we used the same mg for p/t = 4/5, 1/2, 1/4, 1/5, 1/8, 1/16, where mg is the optimal value of m 
at precision p/r ^ 0. 

Table [III] presents the average code-lengths achieved with the proposed coding scheme operating at precisions 



p/r ~ 1,4/5,1/2, 1/4,1/5,1/8,1/16, and p/r for some values of 9. In Table III we have also shown 
Ljng{9), the average code-length achievable at precision p/r ^ using the optimal value of the parameter m as 



computed in (44 1. At any precision, the average code-length increases with 6. While for a fixed 9, the average 
code-length decreases as the precision becomes finer, the coding gain over integer precision Rice-Golomb code 
diminishes with 9. Note the apparent anomalies at precisions 4/5 and 1/5. The code-lengths at these precisions 
are smaller than those at precisions 1/2 and 1/8, respectively. When the rounded prediction [x]p/^ is an integer 
multiple of 1/2, there are pairs of equidistant integers from the prediction; but for each pair, the integer with negative 
residual is ranked ahead of that with positive residual by the generalized mapping ([TTjl. As both the integers are 
equiprobable, such a bias is inevitable, and ultimately reduces the coding efficiency. While precisions 4/5 and 
1/5 avoids rounding the predictions to any multiple of 1/2, this is not the case when r = 2^. Despite having 
this limitation, the compression achievable at precision 1/16 is almost the same as the compression achievable at 
precision p/r ^ 0. 

In Fig. |6j we present the redundancy incurred by the proposed coding scheme operating at precisions p/r = 
1,4/5,1/2, 1/4,1/5,1/8,1/16 as compared to the average code-length Lmg{9) achieved at precision p/r 0. 
It follows from the figure that, in coding Laplace distributed residuals, the redundancy incurred by the original 
Rice-Golomb coding scheme with unit precision is as high as 5.75%. 



October 26, 2012 



DRAFT 



18 



7 



Precision { p/x) = 1 

Precision ( p/x) = 4/5 



V 



Precision (p/x) = 1/2 
Precision (p/x) = 1/4 
Precision (pA) = 1/5 



6 




0, 



Precision (p/x) = 1/8 

Precision (p/x) = 1/16 







0.1 



0.2 



0.3 



0.4 0.5 0.6 



0.7 



0.8 



0.9 



1 



Scale parameter, 6 



Fig. 6: The redundancy of the modified Rice-Golomb code at precision p/r = 1,4/5, 1/2, 1/4, 1/5, 1/8, 1/16 as 
compared to the average code-length achieved at precision p/r ^ 0. 



In this paper, the Rice-Golomb predictive coding scheme has been modified so that it can exploit the real-valued 
predictions more efficiently. It has been shown theoretically and demonstrated empirically that the proposed scheme 
achieves better compression as compared to the Rice-Golomb code when the predictions are from the real domain. 
Besides better compression, the proposed scheme has several advantages over the Rice-Golomb code when simplicity 
in analysis and implementation is of concern. Firstly, the proposed scheme allows the residuals to be modeled using 
the Laplace distribution instead of its discrete counterpart - the two-sided geometric distribution. Secondly, the 
proposed coding scheme operating at finest precision is symmetric in the sense that it assigns codewords of equal 
length to equally probable residual intervals. The symmetry in both the Laplace distribution and the coding scheme 
facilitates the analysis of the code to determine the average code-length and the optimal value of the coding 
parameter. Finally, the scale parameter of a Laplace distribution can be easily estimated online from the previous 
residual values. Thus we do not need to explicitly transmit the coding parameter m, as both the encoder and the 
decoder can estimate the same 6 and choose the optimal m given the estimated value of the 6. 



V. Conclusions 
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Appendix 

A. Proof of Lemma [i] 

Let divide the interval [{T + l)/2 + ini/2, (T + l)/2 + (z + l)m/2) into two sub-intervals [{T + l)/2 + im/2, {i- 

l)m/2) and [{i + l)m/2, (T + l)/2 + {i + l)m/2). Then for the first sub-interval we get, 

{i + l)m 
2 

< I + 1 



T+l 


+ 


im 


2 




{T+l) 
m 


,.<^ 

m 


2\e\ 




3 = * 


m 





and 

T + l im (i + l)m 

1 < e < ^ — 

2 2 - 2 

{T + 1) + im < [2\e\\ < {i + l)m 
^ (T+ 1) < [2|e|J mod m = < m. 

In this sub-interval, representation of j and k require 1 + i and [Ig m] = 1 + [Ig m\ bits respectively. Hence, 

„ , , ,, , T+l im (i + l)m 

£^{e) = 2 + i+[\gm\, when + — < £ < ^ ^ ^ . (48) 



For the second sub-interval we get. 



(i + l)m T + l (i + l)m 

— < e < h ^ — 

2 - 2 2 

2\e\ T+l , 

^ z + 1 < < + {i + 1 

m m 



2|e| 



=2=1+1 



and 



{i + l)m T + l (i + l)m 

— < e < h ^ — 

2 - 2 2 

{i + l)m<2\e\<{T + l) + {i + l)m 
=^ < [2|e|J mod m = A; < r+ 1. 

Thus, in the second sub-interval, representations of j and k require 2 + i and [IgmJ bits respectively. Hence, 

„ , (i + l)m T+l (i + l)m 

= 2 + z + [IgmJ, when ^ ^ ' < e < -— + (49) 



Thus, when ^ -l ™ 



2 < e < ^ + from (|48|l and (|49|l we get. 



^™(e) 2 + i + [IgmJ. 



□ 
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B. Proof of Lemma |6| 
Proof: 

(n+ 1)0" + n0' 



Since e (0, 1) and n > 0, ^^^^^^ > 0. Therefore, /„(</)) is a monotonically increasing function of (\). Moreover, 
lim^^o /ri(0) = —1 and lim0^i /„(</>) = 1 and thus /„(</)) can have only one solution in (0, 1). □ 



C. Proof of Lemma [7| 

Proof. There are four possible cases. 

Case 1: to = 2^^ for some integer /3i > and to + 1 = 2*^^ for some integer j32 > Pi 
The only possible value of to for this case is 1. Therefore, 

Li{e) = L2{e) 

^ (1 - 6*1/2) (6» + 6|i/2 _ 1) = 0. 
Since < 6* < 1, (1 - 9^^^) ^ and thus 

e + e^/"^ -1 = 0. 

Substituting 6'^/^ = 4> yields, 

02 + _ 1 ^ 0. 

Case 2: to = 2'^^ for some integer /3i > and m + 1 7^ 2^^ for any integer /32 > 
In this case, [lg(TO + 1)J = IgTO and [lg(TO +1)] = IgTO + 1. Therefore, 

^rn ^ 2ri«('" + l)1 -(„ + !) 

l + lgm+- W = 1 + Llg(»7i + 1)J + 



^ 0"'+^ + 0" - 1 = where = e^/^. 

Case 5.- to 7^ 2*^^ for any integer /3i > and to + 1 = 2'^^ for some integer /32 > 
In this case, [IgTOj — lg(TO + 1) — 1 and [Igm] = lg(TO + 1). Therefore, 

6* 2 

^ 1+Llgmj+- = l + lg(TO+l) + —r 

^ (/>™+i + 0" - 1 = where = 6*^/2^ 

Case 4: m ^ 2*^^ for any integer /3i > and to + 1 7^ 2^^ for any integer /32 > 
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In this case, [IgmJ — [lg(m + 1)J and [Igm] = [lg(m + 1)]. Therefore, 

2riS"i1 _^ 2rl8("^+l)1 

^ 1 + [IgmJ + ' = 1 + [lg(m + 1)J + 

^ 0'"+^ + 0™ - 1 = where = ^^/^^ 
Since according to Lemma |6] 1 = has a unique solution in (0, 1), the lemma follows. □ 



D. Proof of Lemma [S] 

Proof: Since (f>rn is the root of the polynomial, (/)'"+^ + c/)™ — 1 = 0, 
Since < < 1, 

" < 1. (50) 



,m+l 



However, (j)m+i is the root of the polynomial, cj)^^^'^ + cjf^^^ — 1 = and thus 



cIi+c:hi- (51) 



It follows from (50 1 and (51 1 that (pm+i > 4>m- D 



E. Proof of Lemma [9] 
Proof: For all m > 1, 



limL™((?) = 1+ [IgmJ. (52) 



Since [lg(m + 1)J > [Ig mj , the lemma follows. □ 
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